Localization of Difficult-to-Translate Phrases
نویسندگان
چکیده
This paper studies the impact that difficult-totranslate source-language phrases might have on the machine translation process. We formulate the notion of difficulty as a measurable quantity; we show that a classifier can be trained to predict whether a phrase might be difficult to translate; and we develop a framework that makes use of the classifier and external resources (such as human translators) to improve the overall translation quality. Through experimental work, we verify that by isolating difficult-to-translate phrases and processing them as special cases, their negative impact on the translation of the rest of the sentences can be reduced.
منابع مشابه
Locating and Reducing Translation Difficulty
The challenge of translation varies from one sentence to another, or even between phrases of a sentence. We investigate whether variations in difficulty can be located automatically for Statistical Machine Translation (SMT). Furthermore, we hypothesize that customization of a SMT system based on difficulty information, improves the translation quality. We assume a binary categorization for phra...
متن کاملLanguage Model Adaptation for Difficult to Translate Phrases
This paper investigates the idea of adapting language models for phrases that have poor translation quality. We apply a selective adaptation criterion which uses a classifier to locate the most difficult phrase of each source language sentence. A special adapted language model is constructed for the highlighted phrase. Our adaptation heuristic uses lexical features of the phrase to locate the r...
متن کاملTranslating Named Entities Using Monolingual and Bilingual Resources
Named entity phrases are some of the most difficult phrases to translate because new phrases can appear from nowhere, and because many are domain specific, not to be found in bilingual dictionaries. We present a novel algorithm for translating named entity phrases using easily obtainable monolingual and bilingual resources. We report on the application and evaluation of this algorithm in transl...
متن کاملTranslating Phrases in Neural Machine Translation
Phrases play an important role in natural language understanding and machine translation (Sag et al., 2002; Villavicencio et al., 2005). However, it is difficult to integrate them into current neural machine translation (NMT) which reads and generates sentences word by word. In this work, we propose a method to translate phrases in NMT by integrating a phrase memory storing target phrases from ...
متن کاملExample-Based Machine Translation of Short Phrases Using the Context Equivalence Principle
In this paper we consider machine translation in a very specific setting of computerassisted software localization. This setting imposes certain requirements on the kind of translation assistance expected from the computer, and we believe that simple examplebased techniques suit best for the task. We propose a very simplistic method, that exploits the assumption of context-equivalence common to...
متن کامل